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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/855 f 34 OA 



DATE: 08/07/2002 
TIME: 10:53:29 



Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\08072002\I855340A.raw 

3 <110> APPLICANT: Hosted, Jr., Thomas J. 

4 Horan, Ann C. 

6 <120> TITLE OF INVENTION: Isolation of Micromonospora carbonacea var africana 

7 pMLPl integrase and use of integrating function for 

8 site- specif ic integration into Micromonospora 

9 halophitica and Micromonospora carbonacea chromosome 
11 <130> FILE REFERENCE: IN01164K 

13 <140> CURRENT APPLICATION NUMBER:. 09/855, 340A 

C--> 14 <141> CURRENT FILING DATE: 2001-05-15 

16 <150> PRIOR APPLICATION NUMBER: 60/204,670 

17 <151> PRIOR FILING DATE: 2000-05-17 
19 <160> NUMBER OF SEQ ID NOS : 16 

21 <170> SOFTWARE: Patentln Ver. 2.1 

23 <210> SEQ ID NO: 1 

24 <211> LENGTH: 1179 

25 <212> TYPE: DNA 

2 6 <213> ORGANISM: Micromonospora carbonacea 

28 <400> SEQUENCE: 1 

29 gtgtggatcg agaagaacgg gcccgtctac cgcattcggg acctcgttcg cggtaaaaag 60 

30 gtcaccattc agaccggtta tccgacgaag accagcgcca agaatgcgat ggtgcagttc 120 

31 cgtgcggagc agttgcaggg caacgcgctc atgccgcgcg gcggtcagat taccctcgcc 180 

32 gatttcgtgg gggagtggtg gccgagctac gaaaagacgc tgaaaccgac cgccgtgaac 240 

33 tcggagggca accggatccg caaccacctc ctgcccatac tcggccatct cacccttgac 300 

34 gagctggacg ggcaggtcac ccagcagtgg gtcaacgacc tggaggccgg cgtcggcccg 360 

35 tggccggagt ccacgcgggg tcgtcggaag ccgctggcag cgaagacgat cagcaactgc 420 

36 cacggcctgc tgcacacgat ctgcggcgcg gcgatcgcgg cgaaacggat caggctcaac 480 

37 ccgtgctctt cgacgatgct gccccggcgc gagccgaaag agatgaagtt cctgagcgac 540 

38 ccggagatcg gtcggcttat cacggcgctt ccgccgcact ggcgaccgct cgtcatgctg 600 

39 ctggtggcga ccggtctgag gtggggtgag gcgatcggcc tgcgcgccgg ccgggtcgac 660 
4 0 ctgctcgccg cgcggccccg gctgaccgtc gtcgagcagc tccaggagct ggccagcacg 720 

41 ggagagctcg tcttccagtc gccgaagacc gcgaagggcc ggcgcacggt cagtttcacc 780 

42 acgaaagtcg ctctactgct tacgccactc atcgccggaa agaaaagtga cgaggtcgtg 840 
4 3 ttcaccgcgc cgaaaggcgg gatggtaagg acgcgcaatt tccggcggat ctgggtcaag 900 

44 gcgtgcgagg aagccgggct tccgggctta cgcattcacg atctgcggca cactcacgcg 960 

45 gcgatcctga tttctgccgg gcgtccgctg tcggcgatct cccgccgcct cggtcactcg 1020 

46 tcgatcgcgg tcacggatct gctgtacggg cacctgcgtg aggaggtcga cgaggggatc 1080 

47 ctcgcggcga tcgaggaggc gatggccggc gtccgggctg aggacctgga ggcggaactc 1140 

48 gacgaggagc tgacggacgt gttggccgac gcagcatga 1179 

51 <210> SEQ ID NO: 2 

52 <211> LENGTH: 426 

53 <212> TYPE: DNA 

54 <213> ORGANISM: Micromonospora carbonacea 
56 <400> SEQUENCE: 2 
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57 atgcgcaaca caccggggct ggggcgcggc acatgggccg 

58 gagcgcgccg gactgaccaa gagcgagttg gccaggcgca 

59 gtcggccggt gggaggacgg caagaaccgg cccgacgacg 

60 gcccaggtgc tcggcctcga cctcgacgaa gccctcgccg 

61 gtcaccccgc cagcgacccc aaccatggac ctggacgagg 

62 gaccccaagc tggacgagga catgaagcgg cgcatcatcg 

63 gagcgcgaca aggcggcggc gatcgaggaa accaagcggc 

64 agctga 

67 <210> SEQ ID NO: 3 

68 <211> LENGTH: 34 

69 <212> TYPE: DNA 

70 <213> ORGANISM: Micromonospora carbonacea 

72 <400> SEQUENCE: 3 

73 ccccggtacg ggttcaattc ccatcagtca cccg 

76 <210> SEQ ID NO: 4 

77 <211> LENGTH: 241 

78 <212> TYPE: DNA 

79 <213> ORGANISM: Micromonospora carbonacea 

81 <400> SEQUENCE: 4 

82 tattagtccg cacgccgccc ggccccgccg gagcggagcg 

83 tggcagagca ccgggttgtg gtcccggttg tcgtgggttc 

84 acacgaaggc cccctccact cggagggggc cttcggcgtt 

85 cggtcggctc ggcgctgggg gactcggccc cgtcggcggg 

86 a 

89 <210> SEQ ID NO: 5 

90 <211> LENGTH: 24 3 

91 <212> TYPE: DNA 

92 <213> ORGANISM: Micromonospora carbonacea 

94 <400> SEQUENCE: 5 

95 tggcgggggt gtggctatta ttagtccgca cgccgcccgg 

96 tggtggctgt agctcagttg gcagagcacc gggttgtggt 

97 ttcccatcag tcacccggca agtggatcta ctccacagca 

98 gggggcctga tgcgtcatag gggacaggta ggggaactca 

99 gtc 

102 <210> SEQ ID NO: 6 

103 <211> LENGTH: 247 

104 <212> TYPE: DNA 

105 <213> ORGANISM: Micromonospora carbonacea 

107 <400> SEQUENCE: 6 

108 taggggaatc cactccggag acgcccggag caatccggag 

109 gtcaggtggc ctgttgaccc cctgaccagg gccccggtac 

110 acccgtacac gaaggccccc tccactcgga gggggccttc 

111 gtcaggcggt cggctcggcg ctgggggact cggccccgtc 

112 ccgggga 

115 <210> SEQ ID NO: 7 

116 <211> LENGTH: 255 

117 <212> TYPE: DNA 

118 <213> ORGANISM: Micromonospora halophytica 
120 <400> SEQUENCE: 7 



catacgtcct 
tccagaagga 
cggacctcgt 
ccgcaggtct 
aaatcgagct 
ccctaatcct 
tcatcgacct 



caccgcccgc 60 
ccgggccacc 120 
tgcccgcgtc 180 
gcgccccggc 240 
ggtccgcacc 300 
ggagcgccgt 360 
gttccgccgg 4 20 
426 



34 



catggtggct gtagctcagt 60 
aattcccatc agtcacccgt 120 
cctgagggtt cgcggtcagg 180 
agtggcctcg gcgtccgggg 240 

241 



ccccgccgga gcggagcgca 60 
cccggttgtc gtgggttcaa 120 
gatcaggccc cctccgaaga 180 
acccccggct ccttgctcgc 240 

243 



catgacggag caaccagcag 60 
gggttcaatt cccatcagtc 120 
ggcgttcctg agggttcgcg 180 
ggcgggagtg gcctcggcgt 240 

247 



file://C:\CRF3\Outhold\VsrI855340A.htm 



8/7/02 



Page 3 of 7 



RAW SEQUENCE LISTING DATE: 08/07/2002 
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121 tttctccgca cccgcccggg gcgttcgacc gggtgcggcg gcatggtggc tgtagctcag 60 

122 ttggcagagc accgggttgt ggtcccggtt gtcgtgggtt caattcccat cagtcacccc 120 

123 aggtaagacc caggtcaggg ccggttctca ccggccctga cgcattttca ggggcatggt 180 

124 gggggcgcta ccgggggtgg ggtgtctcac cgcgagccag catctcgatc aggcgatcga 240 

125 gccggcgctg ccggg 255 

128 <210> SEQ ID NO: 8 

129 <211> LENGTH: 315 

130 <212> TYPE: DNA 

131 <213> ORGANISM: Micromonospora halophytica 

133 <400> SEQUENCE: 8 

134 tttctccgca cccgcccggg gcgttcgacc gggtgcggcg gcatggtggc tgtagctcag 60 
13 5 ttggcagagc accgggttgt ggtcccggtt gtcgtgggtt caattcccat cagtcacccg 120 
13 6 gcaagtggat ctactccaca gcagatcagg ccccctccga agagggggcc tgatgcgtca 180 

137 taggggacag gtaggggaac tcaacccccg gctccttgct cgcgtcgggt catgccgtcc 240 

138 gcgtacccct ccgcgtacct ggccctctcc cgttcctcga tctcggcggc gagctgatcg 300 

139 cgcaggtgcg cctcc 315 

142 <210> SEQ ID NO: 9 

143 <211> LENGTH: 260 

144 <212> TYPE: DNA 

145 <213> ORGANISM: Micromonospora halophytica 

147 <400> SEQUENCE: 9 

148 taggggaatc cactccggag acgcccggag caatccggag catgacggag caaccagcag 60 

149 gtcaggtggc ctgttgaccc cctgaccagg gccccggtac gggttcaatt cccatcagtc 120 

150 accccaggta agacccaggt cagggccggt tctcaccggc cctgacgcat tttcaggggc 180 

151 atggtggggg cgctaccggg ggtggggtgt ctcaccgcga gccagcatct cgatcaggcg 240 

152 atcgagccgg cgctgccggg 260 

154 <210> SEQ ID NO: 10 

155 <211> LENGTH: 209 

156 <212> TYPE: DNA 

157 <213> ORGANISM: artificial sequence 

159 <220> FEATURE: 

160 <223> OTHER INFORMATION: pMLPl attP region 

162 <400> SEQUENCE: 10 

163 taggggaatc cactccggag acgcccggag caatccggag catgacggag caaccagcag 60 
165 gtcaggtggc ctgttgaccc cctgaccagg gccccggtac gggttcaatt cccatcagtc 120 
167 acccggcaag tggatctact ccacagcaga tcaggccccc tccgaagagg gggcctgatg 180 
169 cgtcataggg gacaggtagg ggaactcaa 209 

172 <210> SEQ ID NO: 11 

173 <211> LENGTH: 19 

174 <212> TYPE: DNA 

175 <213> ORGANISM: artificial sequence 

177 <220> FEATURE: 

178 <223> OTHER INFORMATION: primer PR144 

180 <400> SEQUENCE: 11 

181 tgcttcgacg ccatcargg 19 

184 <210> SEQ ID NO: 12 

185 <211> LENGTH: 20 

186 <212> TYPE: DNA 

187 <213> ORGANISM: artificial sequence 
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PATENT APPLICATION: US/09/855 , 340A TIME: 10:53:29 

Input Set : A:\seqlist.txt 
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189 


<220> FEATURE : 




190 


<Z2o> OlHfcjK INrOKMAlIUM: primer PK14D 




192 


<zzU> FEAIUKE: 




TOO 


<zzl> NAMrj/KlijI : IulSC reaLure 




194 


<222> LOCATION: (7 )..(/) 




195 


<223> OTHER INFORMATION: n is inosine (I) 




198 


<400> SEQUENCE: 12 




W--> 199 


gtggaanccg ccgaakccgc 


z u 


201 


<210> SEQ ID NO: 13 




202 


<211> LENGTH: 20 




203 


<212> TYPE: DNA 




z(J4 


<zJLj> UKuANlbM: artiriciai sequence 




206 


<220> FEATURE: 




207 


<223> OTHER INFORMATION: primer PDH504 




209 


<400> SEQUENCE: 13 




210 


agggcaacaa gggaagcgtc 


ZU 


213 


<210> SEQ ID NO: 14 




214 


<Zi±> LENGTH : ^il 




215 


<212> TYPE: DNA 




216 


<213> ORGANISM: artificial sequence 




218 


<220> FEATURE: 




219 


<223> OTHER INFORMATION: primer PDH505 




221 


<400> SEQUENCE: 14 




222 


ggcgggggtg tggctattat t 


21 


225 


<210> SEQ ID NO: 15 




226 


<211> LENGTH: 21 




227 


<212> TYPE: PRT 




228 


<213> ORGANISM: artificial sequence 




230 


<220> FEATURE: 




231 


<223> OTHER INFORMATION: amino acid sequence of open 


reading frame indicated in 



figures 4b 

232 and 4d 

234 <400> SEQUENCE: 15 

236 Ser Pro Asp Ala Glu Ala Thr Pro Ala Asp Gly Ala Glu Ser Pro Ser 

237 15 10 15 

240 Ala Glu Pro Thr Ala 

241 20 

244 <210> SEQ ID NO: 16 

245 <211> LENGTH: 21 

246 <212> TYPE: PRT 

247 <213> ORGANISM: artificial sequence 

249 <220> FEATURE: 

250 <223> OTHER INFORMATION: amino acid sequence of open reading frame indicated in 
figures 5b 

251 and 5d 

253 <400> SEQUENCE: 16 

2 55 Arg Gin Arg Arg Leu Asp Arg Leu lie Glu Met Leu Ala Arg Gly Glu 

256 15 10 15 

259 Thr Pro His Pro Arg 

260 20 
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RAW SEQUENCE LISTING ERROR SUMMARY 
PATENT APPLICATION: US/09/855 , 340A 



DATE: 08/07/2002 
TIME: 10:53:30 



Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\08072002\I855340A.raw 



Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:12; N Pos . 7 
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VERIFICATION SUMMARY DATE: 08/07/2002 

PATENT APPLICATION: US/09/855, 340A TIME: 10:53:30 

Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\08072002\I855340A.raw 

:14 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
:199 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 12 after pos . : 0 



| 
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